Using  Computational  Cognitive  Modeling  to  Predict 
Dual-Task  Performance  With  Sleep  Deprivation 

Glenn  Gunzelmann,  Air  Force  Research  Laboratory,  Mesa,  Arizona,  Michael  D.  Byrne, 
Rice  University,  Houston,  Texas,  Kevin  A.  Gluck,  Air  Force  Research  Laboratory, 
Mesa,  Arizona,  and  L.  Richard  Moore  Jr.,  Lockheed  Martin,  Mesa,  Arizona 


Objective:  The  effects  of  fatigue  on  multiple-task  performance  were  explored  through 
computational  cognitive  modeling.  Background:  Fatigue  typically  has  a  negative  impact 
on  human  performance.  Biomathematical  models  exist  that  characterize  the  dynamics  of 
human  alertness,  but  the  link  between  alertness  and  in  situ  performance  on  specific  tasks  is 
tenuous.  Cognitive  architectures  offer  a  principled  means  of  establishing  that  link.  Method: 
We  implemented  mechanisms  for  fatigue,  which  produce  microlapses  in  cognitive  process¬ 
ing,  into  an  existing  model,  adaptive  control  of  thought-rational,  and  validated  the  perfor¬ 
mance  predictions  with  Bratzke,  Rolke,  Ulrich,  and  Peters’  data  on  fatigue  and  multiple-task 
performance.  Results:  The  microlapse  model  replicated  the  human  performance  results  very 
well  with  zero  free  parameters,  although  the  fit  was  improved  when  we  allowed  two  indi¬ 
vidual  differences  parameters  to  vary.  Conclusion:  Increased  frequency  of  microlapses  as  a 
result  of  fatigue  provides  a  parsimonious  explanation  for  the  impact  of  fatigue  on  dual-task 
performance  and  is  consistent  with  previous  research.  Application:  Our  results  illustrate 
how  using  biomathematical  models  of  fatigue  in  conjunction  with  a  cognitive  architec¬ 
ture  can  result  in  accurate  predictions  of  the  effects  of  fatigue  on  dual-task  performance. 
Extending  and  generalizing  this  capability  has  potential  utility  in  any  safety-critical  domain 
in  which  fatigue  may  affect  performance. 


Cognitive  impairments  resulting  from  fatigue 
have  critical  implications  in  applied  settings. 
Explanations  for  numerous  industrial  and  com¬ 
mercial  disasters,  not  to  mention  more  com¬ 
mon  tragedies,  such  as  car  accidents  (Gallup, 
2002;  Home  &  Reyner,  1999;  Klauer,  Dingus, 
Neale,  Sudweeks,  &  Ramsey,  2006;  National 
Transportation  Safety  Board,  1995;  Pack  et  al., 
1 995),  have  implicated  fatigue  as  a  contributing 
factor  (Caldwell,  2003;  Dinges,  1995;  Mitler 
et  al.,  1 988).  Consequently,  much  of  the  research 
conducted  on  fatigue  is  directed  at  understand¬ 
ing  the  decrements  arising  from  restricted  sleep, 
extended  periods  of  wakefulness,  and  circadian 
rhythmicity.  An  overarching  goal  is  to  come  to 
an  understanding  of  how  sleep  affects  human 
cognition  that  is  useful  for  making  predictions 
about  human  performance  in  applied  settings 
and  thereby  assist  in  planning  and  decision 


making  (Dinges  &  Kribbs,  1991;  Klerman  & 
St.  Hilaire,  2007).  Because  of  the  potential  value 
of  the  research  in  these  contexts,  there  is  a  long 
history  of  empirical  and  theoretical  research, 
despite  the  cost  and  logistical  issues  associated 
with  conducting  controlled  studies  of  fatigue. 

In  order  to  use  research  results  from  sleep 
studies  to  make  predictions  about  task  perfor¬ 
mance,  one  must  understand  two  theoretical 
aspects  of  the  issue.  One  necessary  component 
is  an  accurate  understanding  of  the  endog¬ 
enous  and  exogenous  factors  that  influence 
overall  level  of  alertness  and  how  they  interact 
to  produce  cognitive  performance.  To  address 
this  issue,  biomathematical  models  have  been 
developed  that  integrate  mechanisms  for  time 
awake  and  circadian  rhythms  in  estimating  an 
overall  level  of  cognitive  functioning,  or  alert¬ 
ness  (see  Klerman  &  St.  Hilaire,  2007;  Mallis, 
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Mejdal,  Nguyen,  &  Dinges,  2004).  These  mod¬ 
els  embody  the  current  understanding  of  the 
dynamics  of  alertness  and  some  of  the  internal — 
physiological  or  biological — influences  on  over¬ 
all  cognitive  performance. 

Biomathematical  models  have  been  incorpo¬ 
rated  into  tools  that  can  be  used  to  create  and 
assess  work-rest  schedules  (Dean,  Fletcher, 
Flursh,  &  Klerman,  2007;  Flursh,  Raslear,  Kaye, 
&  Fanzone,  2006).  Despite  their  sophistication, 
however,  the  models  are  limited  to  making  gen¬ 
eral  predictions  about  relative  alertness.  Their 
output  can  be  scaled  to  existing  human  perfor¬ 
mance  data  to  fit  particular  measures,  but  this 
must  be  done  in  a  post  hoc  manner  (e.g.,  Van 
Dongen,  2004)  because  the  models  predict  rela¬ 
tive  alertness,  not  cognitive  performance  and 
behavior.  This  undermines  a  key  application 
goal  for  such  tools,  which  is  to  make  predictions 
about  the  consequences  of  fatigue  in  particular 
task  contexts  when  behavior  and  performance 
data  are  unavailable  in  real-world  settings. 

The  limitations  of  biomathematical  mod¬ 
els  point  to  the  other  necessary  component  of 
a  predictive  model  of  fatigue.  Specifically,  in 
addition  to  understanding  the  dynamics  of  per¬ 
formance  overall,  it  is  also  necessary  to  under¬ 
stand  the  mechanisms  through  which  fatigue 
operates  to  affect  cognitive  processing  and, 
ultimately,  behavior  and  performance.  Among 
the  theoretical  accounts  that  have  been  devel¬ 
oped  in  the  sleep  research  community,  a  com¬ 
mon  perspective  is  that  fatigue  has  two  primary 
impacts  on  cognitive  performance:  a  general¬ 
ized  cognitive  slowing  that  affects  performance 
in  short-reaction-time  tasks  combined  with  an 
increased  probability  of  cognitive  lapses,  which 
refers  to  breakdowns  in  cognitive  process¬ 
ing  lasting  on  the  order  of  seconds  (Dinges  & 
Kribbs,  1991;  Doran,  Van  Dongen,  &  Dinges, 
2001;  Fleuer,  Kleinsorge,  Klein,  &  Kohlisch, 
2004).  Flowever,  these  accounts  are  merely 
descriptive;  they  have  little  to  say  about  the 
cognitive  mechanisms  underlying  fatigue  and 
how  those  mechanisms  interact  with  others  to 
produce  decrements  in  cognitive  performance. 

MODELING  APPROACH 

Our  research  is  positioned  at  the  intersection 
of  relative  alertness  predictions  and  quantitative 


performance  predictions.  Specifically,  we  use 
computational  cognitive  models  that  generate 
quantitative  performance  predictions  in  specific 
task  contexts.  The  models  are  implemented  in  a 
cognitive  architecture,  which  represents  a  theory 
of  the  information-processing  mechanisms  that 
underlie  human  cognitive  capabilities  (Newell, 
1990).  By  using  a  cognitive  architecture,  one 
can  implement  theories  of  fatigue  as  influences 
on  those  computational  mechanisms  that  lead  to 
identifiable  changes  in  the  behavior  of  models. 
This  involves  manipulating  parameters  in  the 
cognitive  architecture  related  to  aspects  of  its 
information-processing  activity.  Subsequently, 
when  one  links  the  dynamics  of  those  param¬ 
eters  to  the  biomathematical  models  of  alertness 
described  earlier,  it  is  possible  to  create  a  bridge 
between  understanding  the  dynamics  of  fatigue 
and  understanding  the  impact  of  those  dynamics 
on  cognitive  processing  and  human  behavior. 

We  have  already  demonstrated  the  value  of 
this  integration  for  generating  principled,  quan¬ 
titative  accounts  of  fatigued  performance  (e.g., 
Gunzelmann,  Gluck,  Kershner,  Van  Dongen, 
&  Dinges,  2007;  Gunzelmann,  Gross,  Gluck,  & 
Dinges,  in  press).  In  this  article,  we  explore  extend¬ 
ing  such  accounts  to  novel  task  contexts  with  the 
goal  of  making  principled,  a  priori  predictions. 

The  cognitive  architecture  that  we  are  using  is 
ACT-R,  or  adaptive  control  of  thought-rational 
(Anderson  et  al.,  2004).  ACT-R  is  a  computa¬ 
tional  theory  of  human  cognition  and  performance 
that  has  been  used  to  simulate  human  behavior 
in  a  wide  variety  of  tasks  (see  Anderson  et  al, 
2004,  and  Anderson  &  Lebiere,  1998,  for  par¬ 
tial  reviews).  ACT-R  consists  of  multiple  mod¬ 
ules  (e.g.,  cognitive,  motor,  visual)  that  can  act 
in  parallel.  These  modules  communicate  with 
central  cognition  through  a  set  of  buffers  contain¬ 
ing  information  about  each  module’s  processing 
(e.g.,  the  visual  module  has  two  buffers  that  hold 
information  about  what  is  currently  seen  and 
where  it  is). 

Central  cognition  in  ACT-R  is  represented 
as  a  production  system  in  which  the  patterns 
instantiated  across  the  set  of  buffers  lead  to  cog¬ 
nitive,  perceptual,  or  motor  actions.  Many  of 
ACT-R’s  modules  have  serial  constraints;  most 
relevantly,  the  procedural  module  in  ACT-R 
can  execute  only  a  single  production  on  a  given 
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cognitive  cycle,  which  represents  a  central  bot¬ 
tleneck  in  cognitive  processing.  Additionally,  all 
of  the  symbolic  architectural  mechanisms  have 
associated  numerical  quantities  that  allow  for 
continuous,  graded  changes  in  cognitive  perfor¬ 
mance,  a  key  feature  of  the  degradations  associ¬ 
ated  with  fatigue.  Further  details  about  ACT-R 
are  available  elsewhere  (e.g.,  Anderson,  2007). 

The  biomathematical  model  we  use  in  the 
research  described  here  is  referred  to  as  the  circa¬ 
dian  neurobehavioral  performance  and  alertness 
model,  or  CNPA  (Jewett  &  Kronauer,  1999).  It 
produces  an  estimate  of  overall  cognitive  perfor¬ 
mance,  identified  as  cognitive  throughput  in  the 
model.  The  predictions  take  into  account  the  influ¬ 
ences  of  a  sleep  homeostat  system  and  circadian 
rhythms,  instantiating  the  widely  accepted  two- 
process  model  of  human  alertness  (Achermann, 
2004;  Borbely,  1982).  In  addition,  CNPA  has 
mechanisms  that  represent  the  impact  of  light  on 
the  phase  and  amplitude  of  the  circadian  pace¬ 
maker  and  a  sleep  inertia  process,  which  accounts 
for  reduced  levels  of  alertness  when  individuals 
initially  awaken.  The  model  has  been  shown  to 
capture  the  dynamics  of  alertness  as  reflected  by  a 
variety  of  behavioral  and  physiological  measures 
(e.g.,  Jewett  &  Kronauer,  1 999). 

We  have  used  previous  research  to  establish 
a  link  between  biomathematical  model  predic¬ 
tions  of  alertness  and  parameters  in  ACT-R  (e.g., 
Gunzelmann  et  al.,  2007;  in  press).  The  results 
have  shown  that  a  simple  linear  mapping  of 
alertness  predictions  to  specific  parameter  val¬ 
ues  in  ACT-R  produces  changes  in  model  per¬ 
formance  that  are  well  aligned  with  changes  that 
are  observed  in  human  performance  as  fatigue 
levels  increase.  In  the  next  section,  we  describe 
a  dual-task  paradigm  and  a  data  set  that  has  pro¬ 
vided  an  opportunity  to  conduct  an  evaluation  of 
some  of  the  mechanisms  we  have  proposed. 

TASK  CONTEXT  AND  RELEVANCE 

Dual-tasking  is  a  fundamental  skill  in  natural¬ 
istic  settings.  Perhaps  the  most  notorious  mod¬ 
ern  example  of  dual-tasking  is  dialing  and  using 
a  cell  phone  while  driving  (e.g.,  Salvucci,  2006), 
but  there  are  countless  more  mundane  examples, 
such  as  reading  while  watching  TV,  listening  to 
music  while  typing,  or  checking  e-mail  during  a 
meeting  or  presentation.  Dual-tasking  situations 


require  the  management  of  cognitive  resources 
and  the  effective  execution  of  actions  in  the 
service  of  each  task.  It  is  commonly  observed 
that  performance  on  one  or  both  tasks  degrades 
when  they  are  performed  together  versus  when 
each  is  done  in  isolation.  In  laboratory  settings, 
dual-task  paradigms  provide  an  opportunity  to 
examine  in  detail  issues  related  to  distributing 
cognitive  resources  to  perform  multiple  tasks 
simultaneously.  When  one  understands  these 
details,  a  better  understanding  of  how  more  com¬ 
plex  tasks  are  interleaved  in  naturalistic  contexts 
can  emerge. 

Research  on  dual  tasking  has  often  looked 
at  a  phenomenon  referred  to  as  the  psycho¬ 
logical  refractory  period,  or  PRP  (Meyer  & 
Kieras,  1997a;  1997b;  Pashler,  1994;  Pashler  & 
Johnston,  1989).  In  a  typical  study,  participants 
are  presented  with  two  tasks,  each  of  which  is 
a  two-choice  reaction  time  task.  Critically,  the 
stimulus  for  Task  2  is  presented  at  some  delay, 
or  stimulus  onset  asynchrony  (SOA),  relative  to 
Task  1.  Across  a  variety  of  circumstances,  reac¬ 
tion  times  for  Task  2  tend  to  increase  when  the 
SOA  is  short.  The  PRP  effect  is  defined  as  the 
difference  between  reaction  time  in  a  short-SOA 
condition  (50  ms  in  the  experiment  described 
later)  and  the  reaction  time  when  the  SOA  is 
long  (1,000  ms  in  the  experiment). 

Pashler  (1994)  postulated  that  the  PRP 
effect  reflects  a  central  information-processing 
bottleneck  that  imposes  a  serial  constraint  on 
cognitive  activity.  However,  alternative  pro¬ 
posals  suggest  that  the  PRP  effect  is  actually  a 
function  of  strategies  adopted  by  participants 
to  meet  the  requirement  that  the  response  for 
Task  1  be  issued  first.  This  latter  explana¬ 
tion  has  been  referred  to  as  strategic  response 
deferment  (Meyer  &  Kieras,  1997a,  1997b). 
Distinguishing  between  these  alternatives  has 
potentially  important  implications  in  complex 
applied  tasks  with  time-critical  components. 
Byrne  and  Anderson  (200 1 )  presented  a  compu¬ 
tational  cognitive  model  that  incorporates  both 
a  cognitive  bottleneck  and  strategic  response 
deferment,  which  provided  a  close  fit  to  human 
performance  data  across  multiple  experimental 
variations.  This  model  provides  the  foundation 
for  the  current  effort  to  validate  a  computational 
account  of  fatigue. 
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The  data  set  for  this  evaluation  comes  from 
an  experiment  described  in  Bratzke,  Rolke, 
Ulrich,  and  Peters  (2007),  who  used  a  task 
based  on  the  paradigm  described  in.  In  the 
Bratzke  et  al.  study.  Task  1  required  partici¬ 
pants  to  respond  to  an  auditory  signal  (high 
versus  low  tone),  whereas  Task  2  required 
a  response  to  a  visual  stimulus  (X  versus  O ).  Task 

1  responses  were  left-hand  key  presses  and  Task 

2  responses  were  right-hand  key  presses.  The 
three  SOAs  used  by  Bratzke  et  al.  (2007)  were  50, 
200,  and  1 ,000  ms.  Participants  ( n  =  6)  completed 
108  trials,  broken  into  three  blocks  of  36,  every 
2  hr  during  28  hr  of  continuous  wakefulness. 

The  data  from  Bratzke  et  al.  (2007)  are  pre¬ 
sented  in  Figure  1  (solid  black  lines).  Parti¬ 
cipants  showed  a  gradual  speed-up  on  both  tasks 
early  on,  which  is  attributable  to  learning.  In 
contrast,  during  the  overnight  hours,  reaction 
times  increased  substantially  for  both  tasks 
(see  Figures  la  and  lb).  Figure  lb  shows  reac¬ 
tion  times  for  Task  2  as  a  function  of  the  SOA. 
Although  reaction  time  increases  regardless  of 
the  SOA,  it  increases  more  when  the  SOA  is 
shorter,  which  is  reflected  in  an  increase  in  the 
PRP  effect  (Figure  lc).  The  overall  pattern  of 
results  is  consistent  with  a  substantial  body  of 
research  investigating  the  interaction  between  sleep 
homeostatic  processes  and  circadian  rhythms. 

The  experiment  and  data  offer  an  exciting 
opportunity  to  evaluate  our  capacity  to  make 
precise  predictions  regarding  the  effect  of 
fatigue.  Dual-task  performance  is  an  increas¬ 
ingly  important  measure  as  technology  adds 
complexity  and  distraction  to  everyday  activi¬ 
ties.  Simultaneously,  continuous  operations  and 
activity  force  individuals  to  function  and  per¬ 
form  with  inadequate  sleep  and  at  times  when  the 
circadian  system  is  applying  pressure  to  sleep. 
Unfortunately,  research  labs  that  are  equipped 
to  conduct  research  on  fatigue — typically  in  a 
hospital  setting  with  24-hr  staffing — often  focus 
on  physiological  measures  to  the  exclusion  of 
detailed  performance  data.  This  limits  opportu¬ 
nities  to  carefully  evaluate  theoretical  accounts 
in  sufficient  detail.  Thus,  we  feel  that  the  theo¬ 
retical  value  of  the  opportunity  afforded  by  the 
detailed  performance  data  reported  in  Bratzke 
et  al.  (2007)  outweighs  concerns  regarding 
methodological  details  in  their  study,  including 


the  relatively  small  sample,  which  actually  is 
fairly  typical  of  sleep  deprivation  studies.  Some 
limitations  are  discussed  later,  and  we  pro¬ 
vide  illustrations  of  the  utility  of  our  modeling 
approach  as  well  as  cautions  regarding  potential 
influences  on  human  performance. 

In  discussing  the  degradations  in  perfor¬ 
mance  that  were  observed  as  fatigue  increased, 
Bratzke  et  al.  (2007)  focused  on  the  cognitive 
slowing  account  of  fatigue.  Indeed,  a  gener¬ 
alized  slowdown  does  seem  to  characterize 
the  observed  changes  in  human  performance 
appropriately  as  alertness  declines.  In  contrast 
to  this  perspective,  however,  we  have  proposed 
an  account  of  fatigue  based  on  microlapses, 
which  are  very  brief  gaps  in  cognitive  process¬ 
ing  (approximately  50  ms).  The  brief  duration 
of  microlapses  makes  them  unlike  the  cogni¬ 
tive  lapses  mentioned  earlier,  which  have  been 
characterized  as  lasting  from  several  to  10s  of 
seconds  (e.g.,  Doran  et  al.,  2001). 

A  feature  of  our  account  is  that  micro¬ 
lapses  can  produce  both  slowed  reactions  and 
response  lapses,  depending  on  their  probability 
(Gunzelmann  et  al.,  in  press).  In  this  account, 
microlapses  are  the  result  of  decreased  alert¬ 
ness,  which  interferes  with  the  production 
execution  process  in  ACT-R’s  procedural  sys¬ 
tem.  To  evaluate  this  account  in  the  context 
of  dual-task  performance,  we  applied  these 
mechanisms  directly  to  the  model  presented 
in  Byme  and  Anderson  (2001).  The  resulting 
model  was  used  to  generate  predictions  regard¬ 
ing  the  impact  of  fatigue  on  performance  in  the 
dual-task  experiment  reported  in  Bratzke  et  al. 
(2007).  The  model  and  the  fatigue  mechanisms 
are  described  in  the  next  section. 

ACT-R  MODEL  OF  DUAL-TASK 
PERFORMANCE 

The  ACT-R  model  of  the  PRP  we  used  is 
nearly  identical  to  the  one  presented  in  Byme 
and  Anderson  (2001);  only  minor  changes  in 
the  model’s  knowledge  were  required  to  accom¬ 
modate  the  differences  in  stimuli  between  the 
experiments.  The  model  contains  a  small  num¬ 
ber  of  productions,  which  implement  a  simple 
perceive-decide-act  strategy  for  both  tasks. 
Because  of  ACT-R’s  cognitive  bottleneck,  the 
decision  phase  of  Task  2  may  have  to  wait  for 
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Figure  1.  Human  data  from  Bratzke,  Rolke,  Ulrich,  and  Peters  (2007)  and  performance  of  the  ACT-R  model 
with  and  without  parameter  variation:  (a)  reaction  time  (ms)  for  Task  1,  (b)  reaction  time  (ms)  for  Task  2  as 
a  function  of  the  stimulus  onset  asynchrony  (SOA),  and  (c)  magnitude  of  the  psychological  refractory  period 
effect  (ms),  computed  by  subtracting  mean  reaction  for  Task  2  at  1,000  ms  SOA  from  the  mean  reaction  time  for 
Task  2  at  50  ms  SOA.  In  all  figures,  human  data  are  represented  by  solid  black  lines.  Gray  lines  illustrate  zero 
free  parameter  predictions,  and  short  dashed  lines  show  model  performance  when  arousal  (G)  is  allowed  to  vary. 
The  human  empirical  data  (solid  lines)  used  in  this  evaluation  are  from  “Central  Slowing  During  the  Night,”  by 
Bratzke  et  al.,  2007,  Psychological  Science,  18(5),  p.  459,  Figure  2.  Copyright  2007  by  Blackwell.  Used  with  per¬ 
mission.  All  rights  reserved. 


Task  1  processing.  Because  of  noise  in  the  dura¬ 
tion  of  the  output  stages  (i.e.,  button  presses), 
the  model  defers  the  response  for  Task  2  until 
the  response  for  Task  1  has  been  initiated,  which 
is  done  to  guarantee  that  responses  are  emitted 
in  order.  This  is  similar  to  the  double-bottleneck 
model  of  De  Jong  (1993)  except  that  the  second 
bottleneck  is  strategic  (knowledge  based)  rather 
than  structural  (i.e.,  an  architectural  constraint), 
which  is  similar  to  the  models  proposed  by 
Meyer  and  Kieras  ( 1 997a,  1 997b).  The  first  bot¬ 
tleneck  is  the  serial  production  system,  which 
represents  an  architectural  constraint  on  central 
cognitive  processing  in  ACT-R.  The  PRP  effect 
occurs  if  either  (or  both)  of  these  bottlenecks 
are  busy  when  the  Task  2  stimulus  is  presented 


at  short  SOAs,  whereas  the  bottlenecks  are 
free  at  longer  SOAs.  The  PRP  model  is  described 
in  more  detail  in  Byrne  and  Anderson  (200 1 ). 

Mechanisms  for  Fatigue 

The  model  from  Byme  and  Anderson  (2001) 
provides  an  a  priori  account  of  human  perfor¬ 
mance  on  the  dual-task  paradigm  in  Bratzke 
et  al.  (2007).  To  make  predictions  about  the 
effects  of  fatigue,  one  must  augment  this 
model  with  mechanisms  that  account  for  deg¬ 
radations  in  cognitive  performance  that  are 
associated  with  extended  wakefulness.  This 
involved  implementing  the  microlapse  account 
from  Gunzelmann  et  al.  (in  press)  in  the  PRP 
model  from  Byme  and  Anderson  (200 1 ).  In  this 
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integrated  model,  slower  reaction  times,  such  as 
those  observed  during  the  overnight  period  in 
Bratzke  et  al.  (2007),  result  from  small  numbers 
of  microlapses  that  interfere  with  task  perfor¬ 
mance  in  the  model. 

Microlapsing.  The  mechanisms  that  produce 
microlapses  in  our  model  were  taken  directly 
from  the  model  in  Gunzelmann  et  al.  (in  press) 
that  performs  the  Psychomotor  Vigilance 
Test  (PVT),  a  sustained-attention  task  used 
extensively  by  sleep  researchers  investigating 
the  impact  of  fatigue  on  cognitive  performance 
(e.g.,  Dinges  &  Powell,  1985;  Doran  et  al., 
2001).  Micro  lapses  are  implemented  through  a 
set  of  mechanisms  that,  with  increasing  levels  of 
fatigue,  increases  the  likelihood  that  a  production 
execution  cycle  will  occur  in  which  no  actions 
are  performed.  This  happens  when  the  expected 
utilities  ( U .)  of  the  applicable  productions  fail  to 
exceed  the  threshold  for  execution,  referred  to 
as  the  utility  threshold  (T).  This  circumstance 
produces  a  microlapse,  instantiated  as  a 
production  execution  cycle  in  which  no  cognitive 
actions  are  perfonned  (lasting  approximately  50 
ms).  Noise  (e)  in  the  utility  computation  (see 
Equation  1)  makes  it  possible  that  no  production 
will  fire  on  a  particular  cycle  (because  no 
production  exceeds  the  threshold),  followed  by 
a  cycle  in  which  at  least  one  production  does 
exceed  T  and  fires. 

u 

U.  =  P.G—C.  +  £  (1) 

The  frequency  of  microlapses  in  the  model 
is  a  function  of  the  G  parameter  from  Equation 
1  and  the  utility  threshold  (T).  As  fatigue 
levels  increase,  G  is  reduced  to  represent 
decreased  alertness.  Based  on  evidence  for 
compensatory  behaviors  on  the  part  of  partici¬ 
pants  to  offset  the  deleterious  effects  of  sleep 
deprivation  (e.g.,  Doran  et  al.,  2001;  Portas 
et  al.,  1998),  we  also  reduced  the  utility  thresh¬ 
old  over  time,  which  made  it  more  likely  that 
some  production  will  fire. 

The  dynamics  of  both  of  these  parameters 
have  been  linked  to  predictions  of  alertness 
from  biomathematical  models  was  used  in  which 
a  linear  scaling  that  required  estimating  a  slope  and 
an  intercept  for  each  mapping  function.  In  Equation 
1,  P  and  C  are  production-specific  parameters 
representing  the  probability  and  anticipated  cost 
(in  time),  respectively,  of  achieving  the  goal 


using  that  production.  Default  values  of  P  =  1 
and  C  —  50  ms  were  used  for  all  productions  in 
this  model. 

The  fatigue  model  also  includes  a  mechanism 
that  dynamically  reduces  arousal  ( G )  during 
a  trial.  This  mechanism  reflects  an  assumption 
that  microlapses  are  indicative  of  the  process 
of  falling  asleep.  Thus,  G  is  decremented  by  a 
small  amount  each  time  a  microlapse  occurs. 
When  a  new  stimulus  is  presented,  G  returns  to 
its  initial  value  at  the  start  of  each  trial,  which 
represents  the  arousing  impact  of  environmental 
stimulation  (e.g.,  Pilcher,  Band,  Odle-Dusseau, 
&  Muth,  2007). 

Gunzelmann  et  al.  (in  press)  provided  a  more 
thorough  description  of  the  mechanisms  and 
parameters  as  well  as  an  evaluation  of  the  abil¬ 
ity  of  these  mechanisms  to  capture  changes  in 
sustained-attention  perfonnance  stemming  from 
fatigue.  Elere  we  have  applied  these  mechanisms 
directly  to  the  model  of  the  PRP  from  Byme  and 
Anderson  (2001)  to  generate  predictions  about 
performance  for  the  Bratzke  et  al.  (2007)  study. 

MODEL  EVALUATION 

The  direct  integration  of  a  set  of  mecha¬ 
nisms  to  account  for  fatigue  into  an  existing 
model  of  dual-task  performance  offers  a  rela¬ 
tively  rare  opportunity  in  computational  cog¬ 
nitive  modeling  research — the  ability  to  make 
truly  a  priori  quantitative  predictions  regard¬ 
ing  performance.  That  is,  by  using  CNPA  to 
estimate  alertness  in  the  current  study,  and 
using  the  same  linear  function  to  map  alert¬ 
ness  predictions  to  ACT-R  parameter  values 
as  estimated  in  Gunzelmann  et  al.  (in  press), 
we  were  able  to  generate  model  performance 
predictions  with  zero  free  parameters  using  the 
model  from  Byme  and  Anderson  (2001).  The 
gray  lines  in  Figure  1  present  such  predictions 
for  the  dual-task  paradigm  described  in  Bratzke 
et  al.  (2007),  aggregated  across  12,000  simu¬ 
lated  PRP  trials  (approximately  111  simulated 
participants  in  the  procedure  used  in  the  original 
study).  The  model  captures  the  qualitative  trends 
quite  well,  with  worsening  performance  during 
the  overnight  hours  and  some  recovery  during 
the  subsequent  morning.  However,  the  model  is 
substantially  faster  than  the  humans  in  Task  1 ,  and 
the  magnitude  of  the  model’s  changes  in  Task  2 
performance  clearly  is  not  as  large  as  in  the 
human  data. 
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Figure  2.  Values  for  arousal  (G)  and  utility  threshold  (Tu)  for  the  model  across  28  hr  of  sleep  deprivation.  Unmodified 
values  for  G  were  estimated  using  the  results  from  Gunzelmann  et  al.  (in  press).  Varied  values  for  G  were  estimated 
by  finding  the  best-fitting  line  mapping  biomathematical  model  predictions  of  alertness  to  G  values  for  the  current 
experiment.  In  both  cases,  T  values  were  derived  from  Gunzelmann  et  al  (in  press). 


There  are  a  number  of  potential  explanations  for 
the  discrepancy  in  the  impact  of  fatigue,  including 
individual  differences  and  methodological  differ¬ 
ences  related  to  the  task  and/or  sleep  deprivation 
protocol.  Of  these,  we  note  that  task  character¬ 
istics  have  been  found  to  influence  motivation 
and  performance  in  important  ways  (Caldwell  & 
Ramspott,  1998;  Drummond,  Brown,  Salamat, 
&  Gillin,  2004;  Pilcher  et  al.,  2007).  That  is,  task 
characteristics  can  influence  the  severity  of  the 
decrements  associated  with  fatigue  by  affecting 
factors,  such  as  motivation,  that  influence  alert¬ 
ness.  In  the  context  of  the  microlapse  hypothesis, 
this  implies  that  different  values  for  G  may  be  nec¬ 
essary  to  capture  differences  in  the  task  itself. 

Figure  1  also  presents  data  from  the  model 
when  the  relationship  of  G  to  alertness  values 
from  CNPA  was  allowed  to  vary  (dashed  lines; 
again  aggregated  across  12,000  trials).  In  this 
case,  the  model  makes  much  more  accurate  pre¬ 
dictions  regarding  the  changes  in  Task  2  perfor¬ 
mance  that  are  observed  during  the  course  of  28 
hr  without  sleep. 

Even  with  variations  to  G,  the  theoretical 
account  is  entirely  consistent  with  the  account 
in  Gunzelmann  et  al.  (in  press)  for  sustained- 
attention  performance.  In  addition,  the  dynamics 
of  G  across  sessions  are  still  tied  to  the  predic¬ 
tions  of  the  biomathematical  model.  By  varying 


G,  we  are  introducing  two  free  parameters: 
the  slope  and  the  intercept  of  the  linear  equa¬ 
tion  mapping  CNPA  predictions  of  alertness  to 
values  of  G  in  ACT-R. 

The  values  of  G  using  both  approaches  are 
shown  in  Figure  2,  along  with  the  values  for  T , 
which  were  the  same  in  both  cases.  The  varia¬ 
tions  of  G  relative  to  T  affect  the  likelihood  of 

u 

occurrences  of  microlapses.  In  the  best  perfor¬ 
mance  session  predicted  by  CNPA  (Session  5; 
following  9  hr  awake),  the  model  in  which  G 
was  varied  produced  an  average  of  5.4  micro¬ 
lapses  per  trial,  although  the  median  (2)  and  the 
mode  (0)  are  more  appropriate,  given  that  the 
distribution  is  highly  skewed.  After  25  hr  awake, 
the  average  soars  to  18.3,  with  corresponding 
increases  in  the  median  (10)  and  mode  (2). 

Figure  1  illustrates  how  the  likelihood  of 
microlapses  affects  overall  performance.  Res¬ 
ponse  times  increase,  and  the  PRP  effect  is 
magnified.  The  modified  estimates  for  G  improve 
the  model’s  correspondence  with  the  data 
(Table  1),  although  the  large  discrepancy  between 
the  model  and  human  data  on  Task  1  performance 
remains.  This  disparity  appears  to  result  from 
the  instructions  given  to  participants  in  Bratzke 
et  al.’s  (2007)  study.  The  ACT-R  model  is  based 
on  task  instructions  that  were  carefully  crafted  to 
produce  compliance  with  the  PRP  assumptions 
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TABLE  1:  Comparison  of  the  Model's  Performance 
to  the  Human  Data  Using  Root  Mean  Squared 
Deviation  (RMSD)  and  Correlation  (r) 


Microlapsing 


Zero  Free 

Parameters 

Vary 

Arousal 

Variable 

RMSD 

r 

RMSD 

r 

Task  1 

168  ms 

.81 

149  ms 

.84 

average  RT 
Task  2 

42  ms 

.96 

32  ms 

.98 

average 

RT  (x  SOA) 
PRP  effect 

25  ms 

.76 

34  ms 

.76 

Note.  RT  =  reaction  time;  PRP  =  psychological  refractory 
period. 

(see  Schumacher  et  al,  1999).  These  instructions 
emphasize  giving  priority  to  Task  1  and  respond¬ 
ing  to  Task  1  first  in  addition  to  responding  as 
quickly  as  possible  to  both  tasks. 

In  contrast,  Bratzke  et  al.  (2007)  instructed 
participants  to  “give  the  same  attention  to  each 
of  the  two  tasks”  and  stated  that  “both  tasks 
are  equally  important”  (Bratzke,  personal  com¬ 
munication,  October  7,  2008).  Such  changes 
to  the  instructions  can  have  important  conse¬ 
quences  for  performance.  In  fact,  when  Pashler 
and  Johnston  (1989)  instructed  participants  to 
“produce  responses  in  a  ‘nice  orderly  rhythm’” 
(p.  36),  Task  1  reaction  times  were  significantly 
longer  than  when  standard  PRP  instructions 
were  given.  Interestingly,  there  was  little  impact 
of  the  instructional  manipulation  on  Task  2 
reaction  times,  which  appears  to  correspond  to 
the  current  empirical  data  based  on  the  contrast 
with  the  model  predictions. 

Our  goal  was  not  to  perform  a  detailed  evalu¬ 
ation  of  the  strategy  variations  that  are  possible 
in  the  PRP.  Nonetheless,  it  may  be  the  case  that 
the  Byme  and  Anderson  (2001)  model  does  not 
capture  precisely  the  strategy  used  by  partici¬ 
pants  in  Bratzke  et  al.  (2007)  because  of  instruc¬ 
tional  differences.  The  more  important  question, 
however,  is  whether  this  discrepancy  in  task 
strategy  has  an  impact  on  the  validity  of  the  con¬ 
clusions  made  regarding  the  impact  of  fatigue. 
We  believe  that  it  does  not.  The  reason  is  that 
these  strategy  variations  involve  mostly  altering 


the  precise  manner  in  which  the  tasks  are  inter¬ 
leaved  (Pashler  &  Johnston,  1989).  Thus,  the 
primary  impact  should  be  on  how  the  two  tasks 
are  prioritized  rather  than  on  altering  the  fun¬ 
damental  processing  that  participants  perform. 
Consequently,  varying  the  model  to  reflect  these 
strategic  choices  would  not  substantially  alter 
the  predictions  regarding  the  impact  of  fatigue. 

CONCLUSION 

At  a  qualitative  level,  it  is  well  understood 
that  fatigue  leads  to  worse  performance  in  a 
variety  of  tasks.  However,  in  any  safety-critical 
domain  in  which  fatigue  is  a  factor,  it  is  obviously 
useful  to  have  more  precision.  Recently,  biologi¬ 
cally  and  physiologically  inspired  mathematical 
models  have  been  generated  that  make  quanti¬ 
tative  predictions  about  alertness.  To  bridge  the 
gap  from  these  models  to  predictions  about  task 
performance,  we  have  developed  a  model  that 
instantiates  alertness  in  a  way  that  captures  its 
effect  on  cognitive  processes.  We  have  demon¬ 
strated  the  predictive  capacity  of  this  model  in 
the  context  of  dual  tasking  and  the  PRP. 

Decrements  in  a  parameter  associated  with 
alertness  in  ACT-R  led  to  microlapses,  which 
increased  reaction  times  for  both  tasks  and  led 
to  larger  PRP  effects  corresponding  to  observed 
changes  in  human  performance.  Some  discrep¬ 
ancies  in  the  fit  of  the  model  to  the  human  data 
illustrate  areas  for  future  research,  such  as  the 
particular  dynamics  of  alertness  as  predicted  by 
the  biomathematical  model  and  the  potential 
influence  of  strategy  on  performing  the  dual-task 
paradigm  used  in  this  study.  However,  the  prog¬ 
ress  presented  in  this  article  demonstrates  how 
mathematical  and  computational  process  model¬ 
ing  can  be  combined  to  provide  a  more  compre¬ 
hensive  account  of  human  performance  under 
the  moderating  influence  of  sleep  deprivation. 

The  integrative  approach  of  this  research 
has  allowed  us  to  present  a  priori  predictions 
regarding  the  impact  of  fatigue.  This  illustrates 
progress  toward  a  major  goal  of  research  in  this 
area,  which  is  to  make  predictions  about  human 
performance  in  novel  contexts,  without  the  ben¬ 
efit  of  empirical  data  for  parameter  fitting.  This 
capacity  is  important  for  both  theoretical  and 
practical,  applied  reasons;  even  approximate 
quantitative  predictions  can  be  crucial  in  applied 
contexts  (Newell  &  Card,  1985).  Explanatory 
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mechanisms  that  provide  only  descriptive 
accounts  of  known  empirical  data,  however,  will 
have  little  predictive  capacity  in  tools  and  sys¬ 
tems  that  are  intended  to  reduce  risk  and  improve 
performance  in  real-world  environments.  Tools 
based  on  descriptive  characterizations  of  cogni¬ 
tive  slowing  and  cognitive  lapsing,  for  instance, 
are  insensitive  to  variations  in  the  task  and  are 
unable  to  generate  predictions  regarding  the 
types,  magnitudes,  and  frequencies  of  errors  that 
will  occur  during  fatigued  performance.  The 
same  is  true  of  mathematical  models  that  quan¬ 
tify  only  relative  alertness. 

Without  generative  computational  mecha¬ 
nisms  for  cognitive  processing,  tools  can  pre¬ 
dict  only  relative  changes  in  performance 
capacity.  Similarly,  cognitive  architectures  tra¬ 
ditionally  have  ignored  the  moderating  effects 
of  fatigue  on  cognitive  processing.  This  article 
provides  a  demonstration  that  our  integrative 
approach,  which  leverages  the  strengths  of  both 
the  biomathematical  and  the  cognitive  archi¬ 
tecture  approach,  moves  us  in  the  direction  of 
achieving  precise,  accurate  predictions  of  future 
performance. 
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